Goto

Collaborating Authors

 nemo megatron model


Now it's NVIDIA being sued over AI copyright infringement

Engadget

This time, authors are suing NVIDIA over its AI platform NeMo, a language model that allows businesses to create and train their own chatbots, Ars Technica reported. They claim the company trained it on a controversial dataset that illegally used their books without consent. Authors Abdi Nazemian, Brian Keene and Stewart O'Nan demanded a jury trial and asked Nvidia to pay damages and destroy all copies of the Books3 dataset used to power NeMo large language models (LLMs). They claim that dataset copied a shadow library called Bibliotek consisting of 196,640 pirated books. "In sum, NVIDIA has admitted training its NeMo Megatron models on a copy of The Pile dataset," the claim states.


Deploying a 1.3B GPT-3 Model with NVIDIA NeMo Megatron

#artificialintelligence

Large language models (LLMs) are some of the most advanced deep learning algorithms that are capable of understanding written language. Many modern LLMs are built using the transformer network introduced by Google in 2017 in the Attention Is All You Need research paper. NVIDIA NeMo Megatron is an end-to-end GPU-accelerated framework for training and deploying transformer-based LLMs up to a trillion parameters. In September 2022, NVIDIA announced that NeMo Megatron is now available in Open Beta, allowing you to train and deploy LLMs using your own data. With this announcement, several pretrained checkpoints have been uploaded to HuggingFace, enabling anyone to deploy LLMs locally using GPUs.